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We introduce and explore a new method for inferring hidden geometric coordinates of nodes in 
complex networks based on the number of common neighbors between the nodes. We compare this 
approach to the HyperMap method, which is based only on the connections (and disconnections) 
between the nodes, i.e., on the links that the nodes have (or do not have). We find that for high 
degree nodes the common-neighbors approach yields a more accurate inference than the link-based 
method, unless heuristic periodic adjustments (or “correction steps”) are used in the latter. The 
common-neighbors approach is computationally intensive, requiring 0{t*) running time to map a 
network of t nodes, versus 0(t^) in the link-based method. But we also develop a hybrid method 
with 0{t^) running time, which combines the common-neighbors and link-based approaches, and 
explore a heuristic that reduces its running time further to 0{t^), without significant reduction in 
the mapping accuracy. We apply this method to the Autonomous Systems (AS) Internet, and reveal 
how soft communities of ASes evolve over time in the similarity space. We further demonstrate the 
method’s predictive power by forecasting future links between ASes. Taken altogether, our results 
advance our understanding of how to efficiently and accurately map real networks to their latent 
geometric spaces, which is an important necessary step towards understanding the laws that govern 
the dynamics of nodes in these spaces, and the fine-grained dynamics of network connections. 
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I. INTRODUCTION 

The main premise of preferential attachment [T] is that 
popularity is attractive [5], but similarity is also attrac¬ 
tive [S]. Combined together these two attractive forces, 
popularity and similarity, have shown to form hidden hy¬ 
perbolic geometries that drive evolution of networks [1]. 
Since these geometries are hidden, effective, or latent, 
they must be inferred from the network structure. Specif¬ 
ically what must be inferred are node coordinates in these 
underlying hyperbolic spaces. Existing approaches [SlIS] 
to such inference are based on the connections (and dis¬ 
connections) between the nodes, i.e., on the links that 
the nodes have (or do not have). Connected nodes are 
attracted to each other, while disconnected nodes repel, 
and these approaches are placing nodes into a hyper¬ 
bolic space based on these attraction and repulsion forces. 
Both approaches in are based on Maximum Like¬ 
lihood Estimation. The approach in [5] embeds a given 
network topology into the hyperbolic plane by maximiz¬ 
ing the likelihood that the topology is produced by the 
equilibrium hyperbolic network model [7], while the ap¬ 
proach in embeds the network by maximizing the like¬ 
lihood that the topology is produced by the hyperbolic 
model of growing networks [4] . Both approaches produce 
similar results, even though there are fundamental dif¬ 
ferences between them. In this paper, we build on the 
latter approach [5], which is more recent and simpler to 
implement. 

The work in [4] shows that trade-offs between pop¬ 
ularity and similarity shape the structure and dynam¬ 


ics of growing complex networks, and that these trade¬ 
offs in network dynamics give rise to hyperbolic geom¬ 
etry. The growing network model in [2] is essentially a 
model of random geometric graphs growing in hyperbolic 
spaces. Synthetic graphs grown according to this simple 
model simultaneously exhibit many common structural 
and dynamical characteristics of real networks. We call 
the model in [4] the Popularity x Similarity Optimization 
(PSO) model. 

Given the ability of the PSO model to construct syn¬ 
thetic growing networks that resemble real networks 
across a wide range of structural and dynamical char¬ 
acteristics, the work in showed how to reverse this 
synthesis, and given a real network, how to map (embed) 
the network into the hyperbolic plane, in a way congruent 
with the PSO model. Specihcally, the mapping method 
of [S], called HyperMap, replays the network’s geometric 
growth, estimating at each time step the hyperbolic co¬ 
ordinates of new nodes by maximizing the likelihood of 
the network snapshot in the model. In the inferred po¬ 
lar coordinates of nodes, the radial coordinate r can be 
associated with node popularity, while the angular coor¬ 
dinate 9 is the node coordinate in the similarity space 
abstracted by a circle. HyperMap has been applied to 
the Autonomous Systems (AS) topology of the real In¬ 
ternet in [5], where it was shown that: (i) the method can 
identify soft communities of ASes belonging to the same 
geographic region, even though the method is completely 
geography-agnostic; (ii) the method can predict missing 
links between ASes with high precision, outperforming 
popular existing methods; and (iii) that the method can 
construct a highly navigable Internet map—greedy for- 
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warding in the map can reach destinations with more 
than 90% success probability and low stretch. 

Here we introduce and explore a new method for in¬ 
ferring the node similarity coordinates, and release its 
implementation to public [5]. This method differs from 
the one in [S] in that it is not based on the links that the 
nodes have or do not have. Instead, it is based on the 
number of common neighbors between the nodes. The 
method is inspired by the observation that the number 
of common neighbors between two nodes is a measure of 
similarity between the nodes; in general, the more the 
common neighbors between two nodes the more similar 
the two nodes are, i.e., the smaller their similarity dis¬ 
tance Eillo]. We call the approach in the link-based 
approach, and the approach considered here the common- 
neighbors approach. We compare the two approaches and 
find that for high degree nodes the common-neighbors 
approach yields a more accurate inference than the link- 
based method, unless heuristic periodic adjustments (or 
“correction steps” 0 ) are used in the latter. On the other 
hand, the common-neighbors approach is computation¬ 
ally intensive, requiring 0{t^) running time to map a net¬ 
work of t nodes, versus 0{t^) in the link-based method. 

Based on these above observations, we then introduce 
a hybrid method with 0{t^) running time, which com¬ 
bines the common-neighbors and link-based approaches, 
and explore a heuristic that can reduce its running time 
further to 0{t^), without significantly sacrificing the em¬ 
bedding accuracy. We apply this method to snapshots 
of the real Internet to reveal how soft communities of 
ASes evolve over time in the similarity space. We also 
demonstrate the method’s predictive power by forecast¬ 
ing future links between ASes. Taken altogether, our re¬ 
sults advance our understanding of how to efficiently and 
accurately map real networks to their latent hyperbolic 
spaces, which is an important necessary step towards un¬ 
derstanding the laws that govern the dynamics of nodes 
in these spaces, and the fine-grained dynamics of network 
connections. 

The rest of the paper is organized as follows. In Sec¬ 
tion we review the extended PSO (E-PSO) model 
from and the details of the HyperMap method that 
we need in this paper. In Section m we show how 
the angular coordinates of nodes can be inferred using 
the common-neighbors approach, and describe the hy¬ 
brid method. In Section |IVj we describe how to speedup 
the method, and in Section |Vj we validate our results in 
synthetic networks. In Section |VI[ we apply the hybrid 
method to the AS Internet. In Section VII[ we discuss 
other relevant work, and in Section VIII we conclude 
with a discussion of open problems and future work. 


II. PRELIMINARIES 

The E-PSO model of growing networks has been intro¬ 
duced in [S] for HyperMap development purposes. As its 
name suggests, this model is a modification of the PSO 


model in [3]. The E-PSO model constructs growing net¬ 
works using external links only, while being equivalent to 
the generalized PSO model in [1] that uses both external 
and internal links [5]. External links connect new nodes 
to existing nodes, while internal links appear between ex¬ 
isting nodes only. Given a single snapshot of the topology 
of a real network, there is no way to distinguish external 
links from internal links. The E-PSO model sidesteps this 
obstacle, and helps to map a given real network topology 
by replaying its geometric growth, treating all links in the 
topology as if they were external [5]. Below, we first re¬ 
view the E-PSO model, and then proceed to HyperMap, 
which is based on this model. We limit the exposition 
only to the basic details that we need in the rest of the 
paper. 


A. The E-PSO model 

The E-PSO model has five input parameters m ^ 0, 
A ^ 0, /3 S (0,1], T G [0,1), and C > 0. Parameters m 
and L are the rates at which external and internal links 
appear. (We will explain shortly how we compute them 
in a real network.) These two parameters appear inside 
Eq. 0 below, and define the average node degree in the 
network, k « 2{m -\- L). Parameter f3 defines the expo¬ 
nent 7 = I-|-l//3^2of the power-law degree distribution 
P{k) ~ k~^ in the network. Temperature T controls the 
average clustering c m in the network, which is max¬ 
imized at T = 0 and nearly linearly decreases to zero 
with T G [0,1). Parameter f = y/—K where K is the 
curvature of the hyperbolic plane. As it is going to be ex¬ 
plained, changing f rescales the node radial coordinates. 
This rescaling parameter does not affect any topological 
properties of networks generated by the model. There¬ 
fore, it can be set to any value in the model, e.g., C = 
without loss of generality. Having these parameters and 
the final size of the network t > 0 specified, the E-PSO 
model constructs a growing scale-free network up to t 
nodes according to the following E-PSO model defini¬ 
tion: 

(1) initially the network is empty; 

(2) coordinate assignment and update: 

(a) at time i = I,2,...,t, new node i is added 
to the hyperbolic plane at polar coordinates 
{ri,0i), where radial coordinate = |lnf, 
while the angular coordinate Oi is sampled uni¬ 
formly at random from [0,27r]; 

(b) each existing node j = 1,2,... ,i — 1, moves 
increasing its radial coordinate according to 
rjii) = Idrj + (1 - PYi, 

(3) creation of edges: node i connects to each existing 
nodej = I, 2,..., i — 1 with probability pij = p(xij) 
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given by 


p{x,j) 


1 

1 -|- 


( 1 ) 


In the last expression, Xij is the hyperbolic distance be¬ 
tween nodes i and j [T^ . 


cosh (^Xij 
where 9ij 
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with li = y^(l —* Equation Q is derived from 

the condition that the expected number of old nodes j < i 
that i connects to, denoted by fhi{t), is 


i(t) = m + 


2L(l-/3) 


(l_i-(l-/3))2(2/5_l) 
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The radial coordinate of a node abstracts its popular¬ 
ity. The smaller the radial coordinate of a node, the more 
popular the node is, and the more likely it attracts new 
connections. The angular distance between two nodes 
abstracts their similarity. The smaller this distance, the 
more similar the two nodes are, and the more likely they 
are connected. The hyperbolic distance Xij is then a 
single-metric representation of a combination of the two 
attractiveness attributes, radial popularity and angular 
similarity. The connection probability p{xij) is a decreas¬ 
ing function of Xij, meaning that new connections take 
place by optimizing trade-offs between popularity and 
similarity [4]. It has been shown [5] that the E-PSO 
model can reproduce not only the degree distribution and 
clustering of real networks like the AS Internet, but also 
several other important properties. Given the ability of 
the model to construct growing synthetic networks that 
resemble real networks, [S] then showed how to reverse 
this synthesis, and given a real network, how to map 
(embed) the network into the hyperbolic plane, in a way 
congruent with the E-PSO model. The mapping method, 
HyperMap, is described next. 


B. HyperMap 

HyperMap is based on Maximum Likelihood Estima¬ 
tion (MLE) and is fully specified in Fig.H On its input it 
takes the network adjacency matrix = aji = I if 

there is a link between nodes i and j, and = a^i = 0 
otherwise), and the network parameters m, L,j,T,C^. It 


then computes radial and angular coordinates ri(t)^9i, 
for all nodes i ^ t in the network. 

HyperMap first estimates the MLE appearance (or 
birth) times of nodes i = 1,2,... ,t. As shown in [^, 
the higher the degree of a node in the E-PSO model the 
earlier its MLE appearance time. Therefore, HyperMap 
uses the following procedure for finding the MLE of the 
node appearance times in a given network with t nodes. 
It sorts all nodes in the decreasing order of their degrees 
ki > k 2 > ... > kt, with ties broken arbitrarily, and sets 
their MLE appearance times i = 1,2,... ,t in the same 
order. That is, the node with the largest degree fci is 
expected to appear first, i = 1, the second largest degree 
node ^2 appeared second, i = 2, and so on. The node 
born at time i is called node i. 

Having a sequence of MLE node birth times, Hyper¬ 
Map replays the hyperbolic growth of the network in 
accordance with the E-PSO model as follows. When a 
node is born at time I ^ ^ t, it is assigned an ini¬ 

tial radial coordinate Xi = | In*, and every existing node 
j < i moves increasing its radial coordinate according to 
Xj{i) = fixj -f (1 — fi)xi. The method assigns to a new 
node i > 1 the angular coordinate 0^ that maximizes its 
local likelihood 

^1= W_ - Pixij)]^~°'' ■ ( 5 ) 

l^j<i 


This likelihood is a function of 0*, since Xij depends on 
6i, see Eq. ([^, p{xij) depends on Xij, see Eq. Q, and 
depends on p{xij). The product in Eq. ([^ goes over all 
the old nodes j < i. The likelihood is called local as 
it depends only on the connections (and disconnections) 
between new node i and existing nodes j < i. For exam¬ 
ple, if new node * = 4 is connected to nodes 1, 2 but not 
to node 3, i.e., 041 = 1, 042 = 1 ,043 = 0, then would 
be = P{x 4 :i)p{xi 2 ){^—p{x 43 )). We use the subscript L 
to emphasize that depends on the links between new 
node i and existing nodes j < i, i.e., it is a link-based 
approach. In the next section, we will derive an alterna¬ 
tive local likelihood, which depends on the number 
of common neighbors between new node i and existing 
nodes j < i. 

The maximization of 0\^ is performed numerically, by 
sampling the likelihood at different values of 6 in 
[0,27r] separated by intervals A0 = 4, and then setting 
Qi to the value of 6 that yields the largest value of C\^. 
Since, to compute for a given 9 we need to compute 
the connection probability between node i and all exist¬ 
ing nodes j < i, we need a total of 0 (*^) steps to perform 
the maximization. If there are t nodes in total, Hyper¬ 
Map needs 0{t^) running time to map the full network. 

Specifying input paxanietexs. Parameter C > 0 can be 
set to any value, e.g., C = 1 - A.s mentioned, changing 
the value of this parameter corresponds to radial coor¬ 
dinate rescaling. Specifically, the radial coordinates of 
nodes will be rescaled by the factor f, since as can be 
seen by steps 5 and 6 in Fig. at the hnal time i = t. 
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1: Sort node degrees in decreasing order ki > k 2 > ■ ■ ■ > kt 
with ties broken arbitrarily. 

2: Call node i, i = 1,2,... ,t, the node with degree ki. 

3: Node i = 1 is born, assign to it initial radial coordinate 
ri = 0 and random angular coordinate 9i € [0, 27r]. 

4: for i = 2 to t do 

5: Node i is born, assign to it initial radial coordinate 

n = I Ini. 

6: Increase the radial coordinate of every existing node 

j < i according to rj{i) = f5rj + (1 — P)ri. 

7: Assign to node i angular coordinate 6i maximizing C\^ 

given by Equation Q. 

8 : end for 


FIG. 1: The HyperMap Embedding Algorithm. 


rj{t) = Pvj + (1 — P)rt = ^ Inj + Inf, j < t. Fur¬ 

thermore, the likelihood C]^ in Eq. ([^ does not depend on 
C, as it cancels out in the connection probability p{xij) in 
Eq.Q. That is, different values of C, will yield exactly the 
same angular coordinates. Parameter m can be obtained 
from historical data of the evolution of the network. If 
such data is available, then m is the average number of 
connections that nodes have once they first appear in the 
data. If no historical data are available, m can be set, as 
an approximation, to the minimum observed node degree 
in the network. Given the average node degree k in the 
network, and knowing m and k, we get L = . The 

power law exponent 7 can be obtained from the degree 
distribution of the network, while parameter T is found 
experimentally [S]. We emphasize that the parameters 
for HyperMap come directly from the observation of the 
real network. With these 5 parameters {m, L,j,T,(), 
and the network adjacency matrix aij, HyperMap infers 
2t hyperbolic node coordinates in a network of t nodes (a 
radial and angular coordinate for each node), and conse¬ 
quently, O(t^) hyperbolic distances between nodes. 


III. INFERRING NODE SIMILARITY 
COORDINATES USING THE NUMBER OF 
COMMON NEIGHBORS 


We now show how the angular (similarity) coordinates 
of nodes can be inferred using the number of common 


neighbors between new and old nodes, instead of the con¬ 
nections and disconnections between them. Specifically, 
we first derive an alternative local likelihood, which 
uses the observed number of common neighbors between 
each new node i and each existing node j < i at final 
time t. Then, we use this likelihood in place of in 
Equation (j^ in order to infer the angular coordinate of 
each node. 

In Section [Vj we show that for small i’s, i.e., for nodes 
that appear at early MLE times, which are the high de¬ 
gree nodes, jCqjh yields a more accurate angular coordi¬ 
nate inference than This is because, for all node pairs 
i,J, j < i, utilizes more information, since it uses 
the final number of common neighbors between the pairs. 
That is, it considers the full network adjacency matrix, 
i.e., the network adjacency matrix at the final time t, and 
uses the number of common neighbors between the node 
pairs at that time. In contrast, Cf in Eq. ([^ uses less 
information, since at each time i ^ t it considers only 
the connections and disconnections between node i and 
old nodes j < i. To derive we first need to compute 
the distribution of the number of common neighbors be¬ 
tween node pairs in the E-PSO model, which is the task 
we perform next. 


A. Distribution of the number of common 
neighbors 


Consider a network that has grown up to t nodes ac¬ 
cording to E-PSO (Section HA I, where nodes are num¬ 
bered according to the order they appear. Consider two 
nodes i, j with j < i and a third node k. The initial radial 
coordinates of these nodes are U = | Inz, rj = | Inj and 
Tfc = |lnfc. We first need to find p{i,j,6i,9j-,k), which 
is the probability that i and j are both connected to k 
given their angular coordinates 9i,9j. Below, we distin¬ 
guish three cases and compute corresponding probabili¬ 
ties piii,j, 9i, 9f, k), P 2 ii,j, 9i, 9j;k) and pj,{i,j, 9„9f,k). 


Case 1: i > j > k. In this case, the connections to k 
happen when j and i first appear, i.e., at times j and i 
respectively. Therefore, 
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1 

pi{t,j,0i,ej;k) = ^ J 


1 


1 


g 2 T ) 1 -|- 6 2T 


dOki where 


( 6 ) 


Xjk = -arccosh [coshCrj cosh(^rk{j) — sinhCr^ sinhCr^Q') cosOjk] 
Xik = -arccosh [cosh Cri cosh Crfc(z) — sinhCci sinhCrfe(z) cos 0^^] 


Rj = ^ In 


2T L 


2 \ 2T li 

. Ri = Ti — - \w - 

C [sinT-TT 


sin Ttt mj{t) _ 

Tk[j) = Prk + (1 - /3)rj, rk{i) = /3rk + (1 - I3)ri. 


r 


Case 2: i > k > j. Here the connection between i and k nection between j and k happens when k first appears, 
happens when i first appears, i.e., at time z, and the con- i.e., at time k. Thus, 


1 

P2(.i,j,0^,ej\k) = — / - 

Jo 1 


1 


1 


~ X 




c?0fc, where 


Xkj = -arccosh [coshCr^ coshCrj(fc) — sinh^rfe sinhCrj(fc) cosfljfe], 
Xik = -arccosh [coshf^r^ coshCrfc(z) — sinh^ri sinhCrfc(z) cos^^/j], 


2 r 2T 4 

— - In ^-^^ ^ 

C [sinTTT TOfc(t) 


, Ri = Ti - -\n 


2T h 
sin TTT fhiit) ’ 


rj{k) = I3rj (1 - P)rk, rk{i) = jJvk + (1 - I3)ri. 


(7) 


Cased: k>i>j. In this final case, both connections with A: happen when fc appears, i.e., at time/c. Therefore, 


1 


1 


1 


g 2T _j_ g 2T Rk) 


dOkj where 


( 8 ) 


7?fc = 'Tfe - ^ In 


Xkj = ^arccosh [coshcosh C^Vj (fc) — sinhsinh C^rj(k) cos Oj^] , 
Xki = ^arccosh [coshCr^ cosh^ri(A:) — sinh([rfc sinhCz’i(A:) cos^fc]) 

, rj{k) = I3rj + {I-I3)rk, ri{k) = jJvi + {I - I3)rk. 


sin Ttt mk{t) 


The integrals in Equations (§-(§ can be only computed 
numerically. Since the connection events are statistically 
independent, the number of common neighbors between 
nodes z and j, j < z, given their angles 4, 9j, is a sum of 
independent Bernoulli trials with different success prob¬ 
abilities, given by Equations Therefore, by the 

Central Limit Theorem m, for sufficiently large net¬ 


work sizes t, the distribution of the number of common 
neighbors riij between z and j is approximately normally 
distributed, i.e., its probability density is approximately 

where its mean fi(i,j,9i,9j) and variance a'^{i,j,9i,9j) 


f(n^j\9i,9j) = 


a{i,j, 9i,9j)y/¥K 
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are 


can compute C\ using Equation ^ 


j-l i-l 

fi{i,j,6„9j) = '^pi{i,j,ei,ej;k) + ^ p2{i, j,0i,ej; k) 


k^l 


A= n = (13) 

l^j<i 






The product goes over all the old nodes j < i. Equa- 

( 10 ) 

tion ( [l^ gives the likelihood that we can use in 
HyperMap in place of in Eq. 0, where is the 
observed number of common neignoors between nodes 
j_i appearing at MLE times i,_), computed from the given 

a^{i,j,9,,9j) = VPi(i, j, dj; fc)(1 - Pi(i, j, 0^; fc)) network adjacency matrix a^-. Note that maximizing 

^CN equivalent to maximizing its logarithm In/^^j^, 


k^l 

i-l 


+ P2{i,j,9^,9,;k){l-p2{i,j,9„9f,k)) , ^ ^ ^ 

t 


+ ^ P 3 (bi;^zj^i;^)(l P3(bj; fc))- -where C = (i — 1 ) In independent of 0 i. 

( 11 ) 


(14) 


k—i-^1 


To compute 9i, 9j) and cr{i,j, 9i, 9j) we use the fact 
that the mean of a Bernoulli random variable with suc¬ 
cess probability p is p, and its variance is p(l — p). The 
computation of p{i,j, 9i, 9j) and 9i, 9j) for each i,j 
pair requires 0{t) steps. 


B. Likelihood and likelihood maximization 


We are now ready to derive the likelihood that we 
can use in place of C\^ in Equation ([^, in order to infer 
the node angular coordinates. 

Consider new node i ^ t in a network that grows 
according to E-PSO up to time t. We denote by 
C\ = /:( 6 »,|ri,{rj(i),%},{nC},TO,L, 7 ,r,C)j<i the like¬ 
lihood that Ps angular coordinate takes value 9^, given 
its ri, the coordinates of the old nodes {rj{i),9j} = 
{ri{i), 9i,r2{i),92, ■. ■ ,rj_i(z), 0 j-i}, the number of com¬ 
mon neighbors between i and each old node j < i at 
the final time t, {nC} = n‘ 2 ,..., nb_i}, and the 

network parameters to, L, 7 , T, Since the distribution 
of the angular coordinates is uniform on [0, 27r], we can 
rewrite £\ using Bayes’ rule as 


£\ = 


27r A 


( 12 ) 


where A = mnlj}\ri,9i,{rj{i),9j},m,L,-f,TX)j<i is 
the likelihood to have the numbers of common neigh¬ 
bors {nC}, if the angular coordinate of node i has value 
9i, conditioned on its radial coordinate, the coordinates 
of the old nodes, and the network parameters. Likeli¬ 
hood £\ = £{{n£}\ri,{rj{i:),9j},m,L,j,T,C)j<i, inde¬ 
pendent of 9i, is the probability that i has the numbers 
of common neighbors with old nodes specified by {nC}, 
conditioned as shown by notation. 

We are looking for the angle 9* that maximizes the 
likelihood £\ in Equation (12), or equivalently, £\. We 


C. HcN versus C\^, and the hybrid method 


As with £^1 Ihe maximization of £cn can be only per¬ 
formed numerically. Th is can be done in the same way 
as with £l (Section IIB), i.e., by sampling the likelihood 
at different values of 9 in [0, 27r] separated by inter¬ 
vals A0 = 1, and then setting 9* to the value of 9 that 


yields the largest value of £cn. (To be more precise, 
we will be using sampling intervals A0 = minjO.Ol, i}.) 
Since, to compute for a given 9 we need to compute 
p{i,j,9i,9j) and a{i,j,9i,9j) between node i and every 
existing node j < i, we need a total of 0{i^t) steps to 
perform the maximization of Aqn. Therefore, if there 
are t nodes in total, HyperMap with requires O(t^) 
running time to map the full network, versus 0(t^) with 


ri 

Likelihoods and £\^ yield different results for the 
first few nodes appearing at early MLE times. Specifi¬ 
cally, all nodes i for which their average number of con¬ 
nections to previous nodes in Eq. (|^ is TOi(t) ^ — 1, are 
expected to be connected to all previous nodes j ^ i — 1 
with a high probability. This condition holds for high 
degree nodes appearing at early MLE times, rendering 
their exact angular coordinate inference with £\^ infea¬ 
sible. This is because £\^ uses the connections and dis¬ 
connections between new and old nodes in order to place 
the nodes at the right angles; if new node i is connected 
to all previous nodes j < i with high probability then 
large zones of different angular coordinates are all quite 
likely with £\^. This effect was noted in [ 3 . In con¬ 
trast, £qj^ can accurately infer the angular coordinates 
of nodes appearing early because it effectively utilizes 
“future” connectivity information as well, i.e., the num¬ 
ber of common neighbors between the nodes at the final 
time t. This important difference between Agpi and £\^ 
is illustrated in Section m We note that since the infer¬ 
ence of the angular coordinates of new nodes appearing 
at later MLE times depends on the inferred angles of 
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high degree nodes appearing early, then if the latter are 
not accurately inferred, the former will not be accurately 
inferred either. 

Given the angular coordinates of high degree nodes ap¬ 
pearing at early MLE times, the inference of the angular 
coordinates of nodes appearing at later MLE times, e.g., 
of nodes i for which fhi{t) < i — 1, using either or 
yields similar results, i.e., the two likelihoods infer ap¬ 
proximately the same angular coordinates for later nodes. 
This effect is also illustrated in Section |V] and it means 
that one can use the following hybrid approach: use 
for the first nodes i for which fhi{t) ^ i — 1, and then use 
for the rest of the nodes for which fhi{t) < i — 1. The 
benefit of this approach is running time, as the number of 
nodes for which rhi{t)^ i—1 is usually quite small, e.g., 
in the order of few tens of nodes. Therefore, HyperMap 
with this hybrid approach will still have 0(t^) running 
time. In the next section, we describe a simple heuristic 
to reduce this running time to 0{t^). 


D. Hybrid method versus C\^ with correction steps 

It was shown in [5] that the accuracy of HyperMap can 
be improved by occasionally running “correction steps” 
right after step 7 in Fig. Specifically, at some prede¬ 
fined set of times i, we visit each existing node j ^ i, 
and having the coordinates of the rest of the nodes 1 ^ i, 
I ^ j, we update j’s angle to the value 0' that maximizes 

= n [1 j, (15) 


where xp is the hyperbolic distance between j and I when 
the youngest of the two nodes appeared, and p(xji) is 
given by Eq. Q, using in it Rj if j > I or Ri if j < 1. 
It has been observed in [5] that these correction steps 
are beneficial when run at relatively small times i. This 
observation fully agrees with our results in this paper. 

These correction steps are a heuristic that tries to ef¬ 
fectively recompute improved angles for the first (high 
degree) nodes, by considering not only the connections 
to their previous nodes, but also connections to nodes 
that appear later, i.e., future connectivity information, 
as in the common-neighbors approach. In Section [V| we 
show that HyperMap with and correction steps yields 
similar results to the hybrid method that does not use 
correction steps. 


IV. SPEEDING UP THE METHOD 

As explained in Section |HI C[ the running time of Hy¬ 
perMap with either the hybrid or link-based approaches 
is 0{t^). Here we introduce a simple heuristic that re¬ 
duces this running time to without significantly 

sacrificing embedding accuracy, as we verify in the next 


section. We first observe that connected nodes are at¬ 
tracted to each other, and are expected to be placed close 
to each other in the angular space [S]. This means that 
for each node i, we can get an initial estimate for its 
angular coordinate, by considering only the previ¬ 
ous nodes j < i in C\^ (Eq. ([^) that are its neighbors. 
This requires only 0{ki) steps, where ki is i’s degree, 
and ki = 0{k) for sufficiently large i. That is, we can 
estimate 0“'* by maximizing the likelihood 

= n 7’(2^b)> (16) 


where the product goes over all previous nodes j < i 
that are i’s neighbors. The maximization of Eq. (161 can 
be performed numerically by sampling the likelihood at 
intervals Ad = 1 as before, yielding a total running time 
of 0{ki) = 0(i)"to find 

Once we estimate 0“'*, we can consider a region around 
it, [6 *“'* —+y]> where 0 < C ^ t is a constant, and 
set the angular coordinate of node i, 0 ijto the value of 9 
that yields the largest value of (Eq. (1^) in this region. 
Since we sample the likelihood at intervals A9 = i, we 
need 0{C) steps to perform this maximization. Taken 
altogether, at sufficiently large times i ^ C we need 
0{iC) = Oil) steps to find 9i. Therefore, if we have 
t^C nodes in total, the total running time to find their 
angles following this procedure is Oit^). The larger the 
value of C the better the results are expected to be in 
general, as we are searching for the optimal value of 9i 
over a larger region, but the procedure will also be slower. 
We validate this speedup heuristic in the next section, 
where we set C = 200, and show that it produces good 
results. 


V. VALIDATION 

In this section we validate the new mapping method 
and its variations. To do so, we first grow synthetic net¬ 
works according to E-PSO up to t = 5000 nodes, with 
m = 1.5, L = 2.5, 7 = 2.1, C = 1, and T = 0.05,0.4,0.7. 
Similar results hold for other parameter values. Then, we 
pass these synthetic networks to HyperMap, using their 
corresponding m, L, 7 , T, C, values, and compute radial 
and angular coordinates for the nodes, using either 
(common-neighbors method), C\^ (link-based method), 
or the hybrid method. We consider the real Internet in 
the next section. 

Inferred versus real angles for nodes appearing 
at early MLE times. Fig. juxtaposes the inferred 
against the real angles for the first 100 nodes, i.e., for the 
nodes that appear at MLE times 1 ^ i ^ 100 for each 
considered network, when or is used. We observe 
that the common-neighbors method is more accurate at 
inferring the angles of these first nodes. The reason for 
this was explained in Section |HI C| Specifically, we see 
in Figs. I^a-c) that can infer the real angles of the 







nodes quite accurately, subject only to a global phase 
shift. This phase shift can take any value in [0, 27r], and 
it is due to the rotational symmetry of the model. The 
exact value of this shift is not important, and it depends 
on the initialization of the angle of the first node in Hy- 
perMap, which can be any random value in [0, 27rl (cf. 
Step 3 in Fig.Q. 

Likelihood landscapes. To gain a deeper under¬ 
standing on the behavior of and we show in 
Fig-i the corresponding likelihood landscapes for dif¬ 
ferent nodes that appear at early MLE times, i = 
5,10,25,30,35,40. To enable comparison between the 
two methods, the link-based likelihood computed 

after fixing the angles of the old nodes j < i to the angles 
inferred by the common-neighbors method. We observe 
that at small i, i = h, 10 , and L\^ behave quite dif¬ 
ferently, achieving their maximum at different values of 
9. As discussed in Section [riIC[ nodes appearing at early 
MLE times are connected to all previous nodes with high 
probability. Therefore large zones of angular coordinates 
are nearly equally likely according to C\^, which is not the 
case with C\^. This difference is evident in the first two 
rows of Fig.j^ showing the landscapes of /1 cn> '^cn 

We also observe that of all possible angular 
coordinates is quite high for early nodes: of any angle 

is above 99%, and is above 92% for all angles. 

At larger times i, i ^ 25, the two likelihoods achieve 
their maximum around the same angle, while their land¬ 
scapes vary in a somewhat simi lar ma nner. This justifies 
the hybrid approach of Section III C which uses to 
infer the angles of the first i nodes for which fhi{t) ^ i—t, 
and then fo infer the angles of the rest of the nodes. 
For the considered networks, relation fhi(t) i — 1 holds 
only for the first 33 nodes, while for the AS Internet snap¬ 
shots in the next section it holds only for the first 36-40 
nodes. 


Inferred versus real angles for all the nodes. 

Fig. i juxtaposes the inferred against the real angles 
for all nodes in each considered network, when the hy¬ 
brid and link-based methods are used. We observe that: 
(i) the hybrid method is more accurate than the link- 
based method, as expected; (ii) Figs. Qa-c) are similar 
to Figs, [^a-c), meaning that as long as the angular co¬ 
ordinates of the first few nodes are accurately inferred, 
then the angular coordinates of the rest of the nodes will 
also be accurately inferred; (iii) the inference is in gen¬ 
eral better at lower temperatures T; and (iv) that the 
inference is in general better for higher degree nodes ap¬ 
pearing at early MLE times, cf. Figs, [^a-c) and[^a-c). 


Connection probability. In Fig. we report the 
connection probability, which is the probability that 
there is a link between a pair of nodes located at hy¬ 
perbolic distance x, using real and inferred node coordi¬ 
nates. This probability is computed as the ratio of the 
number of connected node pairs to the total number of 
pairs of nodes located at distance x. From the figure, we 
observe that all inferred connection probabilities are close 


to the real ones, except from some discrepancies at their 
tails, which are more pronounced at lower T’s. Further¬ 
more, we see that the results with the hybrid method are 
only slightly better compared to the link-based method 
in terms of the connection probability. This suggests that 
the link-based method also produces relatively good map¬ 
pings, even though it cannot infer as well the real angular 
coordinates. 

We also quantify the quality of the obtained mappings 
using two other metrics: (i) the logarithmic loss, and 
(ii) the performance of greedy routing. 

Logarithmic loss. The logarithmic loss is a quality 
metric for statistical inference defined as LL = —ln£, 
where C in our case is the global likelihood 

C= p{x^{t))°^'^ [I - . (17) 


The product goes over all node pairs i,j in the net¬ 
work, Xij(t) is the hyperbolic distance between pair i,j, 
and p{xij{t)) = 1/(1 + jg connection 

probability. We use LL to quantify the quality of the 
inference of the node angular coordinates. Specifically, 
we first compute LL using the inferred node coordinates 
{ri{t),9i}, and then compare the result to the case where 
LL is computed using the inferred ri{tys and random 9i’s 
drawn uniformly from [0, 27r]. We denote the former by 
LL™^ and the latter by The smaller the 

compared to the better the quality of the map¬ 
ping. In particular, the ratio = 

g(LL -LL ) jg ratio of the likelihood with the in¬ 
ferred angular coordinates to the likelihood with random 
angular coordinates. The higher this ratio, the better the 
mapping quality. Table reports the logarithmic losses 

LU^y ^g .^^g^ J^g 2,ireal -g 

the logarithmic loss if we use the real radial and angu¬ 
lar coordinates of nodes. We observe that: (i) the hy¬ 
brid method yields lower logarithmic losses compared to 
the link-based method, which is expected since it infers 
the node angular coordinates more accurately; and (ii) 
that the logarithmic losses for both methods are signif¬ 
icantly lower than those obtained with random angular 
coordinates, and closer to the logarithmic losses obtained 
with the real coordinates. These results suggest that the 
link-based method yields relatively good results, but the 
hybrid approach is better, as expected. 

Performance of greedy routing. One specific class 
of network functions that are impossible without un¬ 
derlying geometry are efficient targeted transport pro¬ 
cesses without global knowledge of the network structure. 
Many real networks have this routing or navigation func¬ 
tion in common; in some networks, including the Inter¬ 
net, this function is their primary function |I4j . There¬ 
fore navigability can be used as an alternative indirect 
metric of embedding quality. Navigability of an embed¬ 
ding is also of independent interest for some applications, 
such as Internet routing [5]. A network embedded in a 
geometric space is said navigable if greedy routing (GR) 
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FIG. 2: Inferred vs. real angles (in radians) for synthetic networks with t = 5000 nodes and parameters m = 1.5, L = 2.5 ,7 = 2.1, 
and T as shown in the captions. The plots juxtapose the inferred against the real angles for the first 100 nodes, i.e., the nodes 
that appear at MLE times 1 sC i ^ 100. In (a-c) the common-neighbors method is used, while in (d-f) the link-based method 
is used. 


TABLE I: Logarithmic Losses in the mappings of Fig. 


Network 

^^real 

hybrid 

link-based 

^^rand 

tll, hybrid 

tll , link-based 

T = 0.05 

1.1 X 10'‘ 

9.6 X 10"^ 

24.8 X 10'‘ 

123 X lO"* 

^1134000 

^982000 

0 

II 

2.4 X 10'‘ 

3.5 X 10"^ 

5.4 X 10^ 

17 X lO"* 

gl35000 

giieooo 

r = o.7 

4.1 X 10'‘ 

4.4 X 10"^ 

5.2 X 10‘‘ 

11 X 10'‘ 

g66000 

gSSOOO 


is efficient according to the metrics considered below. In 
GR, a node’s address is its coordinates in the space, and 
each node knows only the addresses of its neighbors, and 
the destination node address of a “packet”. Upon receipt 
of such a packet, the GR node, if it is not a destination, 
forwards the packet to its neighbor closest to the desti¬ 
nation in the geometric space, and drops the packet if a 
local minimum loop is detected, i.e., if this neighbor is 
the same as the previous node visited by the packet. 

We evaluate the efficiency of GR in the synthetic net¬ 
works of Fig. 1^ using both the HyperMap-inferred (hy¬ 
brid, link-based) and the real node coordinates. We con¬ 
sider the following two GR efficiency metrics mi: (i) 
the percentage of successful paths, Ps, which is the pro¬ 
portion of paths that do not get looped and reach their 
destinations; and (ii) the average hop-length h of the suc¬ 
cessful paths. The results are shown in Table |IT1 where 
we see that: (i) both the hybrid and link-based methods 
yield mappings where GR is_quite efficient, yielding high 
Ps’s and low path lengths h, as it is the case with the 
real node coordinates; and (ii) that the hybrid method 


performs better, as expected, especially at lower T’s. 

Correction steps. We now repeat the same experi¬ 
ments applying the link-based and hybrid methods with 
correction steps, in order to investigate the differences. 
Specifically, for each method we run 4 correction steps 
as described in Section HTTOJ right after all nodes with 
degrees k ^ 60,40,20,10 appear in the network. Each of 
these correction steps is repeated 8 times, which equals 
the average degree k in each network. In the hybrid 
method, the node angular coordinates that were inferred 
using the common-neighbors approach are not altered by 
the correction steps. 

Figures d-f) show that the node angular coordinates 
are now inferred quite accurately with the link-based 
method. Figures [^a-c) show the results for the hybrid 
method, which look similar to Figs. |^a-c); this means 
that the effect of correction steps in this case is not as sig¬ 
nificant. All results are in agreement with Sections |III G| 
and EnDl In all cases, the inference is better at lower 
T’s, as in Fig. 

The corresponding connection probabilities are shown 






























10 







FIG. 3: Likelihood landscapes for different nodes in a synthetic net work with t = 5000 nodes and parameters m = 1.5, L = 
2.5,7 = 2.1,T = 0.4. The plots show the likelihoods Hcn, jCl (Eqs. (13l,([^) and the log-likelihoods In/f^N, In^Li for nodes 
appearing at MLE times i = 5, 10, 25, 30,35,40, as a function of the angular coordinate 6 (in radians). The vertical line in each 


plot shows the inferred angle 0“ 


(in radians), which always corresponds to the global maximum of the likelihood. 
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FIG. 4: Inferred vs. real angles (in radians) for all the nodes in the synthetic networks of Fig.|^ In (a-c) the hybrid method is 
used, while in (d-f) the link-based method is used. 



(a) T = 0.05. (b) T = 0.4. 
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FIG. 5: Connection probabilities with inferred (radial and angular) node coordinates obtained by the hybrid and link-based 
methods, and with real node coordinates. The results correspond to the mappings of Fig. 


in Fig.[^ Compared to Fig.[^ we observe that correction 
steps can help to better capture the connection probabil¬ 
ity tail, in both the hybrid and link-based methods. Fi¬ 
nally, the logarithmic losses and the performance of GR 


are reported in Tables pll| and IV In Table O we observe 
that all logarithmic losses are smaller compared to those 
in Table |Tj and even closer to the logarithmic losses ob¬ 
tained with the real coordinates. This means that correc¬ 
tion steps improve the quality of the obtained mappings 
in all cases. The improvement is quite significant for the 
link-based method, as expected, which at lower tempera¬ 
tures yields even lower logarithmic losses than the hybrid 
method. From Table [rV] we see that the efficiency of GR 
is better compared to the results in Table [III especially 
for the link-based method. We also note from Tables m 
and [lT| that in some high-temperature cases, GR with in¬ 
ferred node coordinates performs even better than GR 


with real node coordinates. A possible explanation for 
this effect is given in Section VIII of [5]. 

Fast methods. Finally, we present results with the 
speedup heuristic described in Section [TV] where we set 
constant C = 200 (Section [rv|). We consider the hybrid 
and link-based methods with correction steps as before, 
and we run the speedup heuristic for all nodes with de¬ 
grees k < fcgpeedup = 10. We call these versions of the 
methods fast versions. Figure shows likelihood land¬ 
scapes sampled by the link-based method, and the cor¬ 
responding regions of the likelihoods sampled by its fast 
version. We observe that the fast version infers the same 
angle as the original version, which always corresponds 
to the maximum of the likelihood. We also observe that 
the initial estimate of the angle is very close to the fi¬ 
nal inferred angle, as expected. Figure [^juxtaposes the 
inferred angles with the original and fast version of the 
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TABLE II: Success ratio ps and average hop-length h of greedy paths in the mappings of Fig. 


Network 

using real coordinates 

using inferred coordinates, hybrid 

using inferred coordinates, link-based 

T = 0.05 

Ps = 0.99, h = 3.0 

Ps = 0.96, h = 3.1 

Ps = 0.82, h = 3.3 

T = 0.4 

Ps = 0.94, h = 3.2 

Ps = 0.95, h = 3.4 

Ps = 0.87, h = 3.5 

o 

II 

Ps = 0.77, h = 3.5 

Ps = 0.92, h — 3.8 

Ps = 0.89, h = 3.9 



steps. steps. steps. 



(d) T = 0.05, link-based with (e) T = 0.4, link-based with (f) T = 0.7, link-based with 

correction steps. correction steps. correction steps. 


FIG. 6: Inferred vs. real angles (in radians) for all nodes in the networks of Fig. In (a-c) the hybrid method is used, while 
in (d-f) the link-based method is used. In both methods the correction steps are run as described in the text. 
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FIG. 7: Connection probabilities with inferred (radial and angular) node coordinates obtained by the hybrid and link-based 
methods, and with real node coordinates. The results correspond to the mappings of Fig. 


TABLE III: Logarithmic Losses in the mappings of Fig. 


Network 

^^real 

hybrid 

link-based 

^^rand 

tll, hybrid 

rLL, link-based 

T = 0.05 

1.1 X 10'‘ 

5.5 X 10"^ 

3.9 X 10"^ 

123 X lO'* 

^1175000 

^1191000 

T = 0.4 

2.4 X lO'^ 

2.9 X 10"^ 

2.8 X 10^ 

17 X lO"* 

gl41000 

gl42000 

r = o.7 

4.1 X 10'‘ 

4.1 X 10'‘ 

4.1 X 10‘‘ 

11 X 10'‘ 

g69000 

g69000 
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TABLE IV: Success ratio ps and average hop-length h of greedy paths in the mappings of Fig. 


Network 

using real coordinates 

using inferred coordinates, hybrid 

using inferred coordinates, link-based 

T = 0.05 

Pa = 0.99, h = 3.0 

Pa = 0.97, h = 3.1 

Pa = 0.98, h = 3.1 

T = 0.4 

Pa = 0.94, h = 3.2 

Pa = 0.96, h — 3.3 

Pa = 0.97, h = 3.3 

L- 

0 

II 

Pa = 0.77, h = 3.5 

Pa = 0.93, h — 3.7 

Pa = 0.93, h = 3.7 


TABLE V: Logarithmic Losses obtained by the fast version 
of the methods (with correction steps) 


Network 

, fast hybrid 

fast link-based 

T = 0.05 

6.2 X 10"^ 

4.0 X 10'‘ 

T = 0.4 

3.0 X 10^ 

2.9 X 10'‘ 

0 

II 

4.2 X 10"^ 

4.1 X 10'‘ 


TABLE VI: Success ratio Pa and average hop-length h of 
greedy paths obtained by the fast version of the methods (with 
correction steps) 


Network 

fast hybrid 

fast link-based 

T = 0.05 

Pa = 0.97, h = 3.1 

Pa = 0.98, h = 3.1 

T = 0.4 

Pa = 0.96, h = 3.3 

Pa = 0.97, h = 3.3 

T = 0.7 

Pa = 0.91, h = 3.7 

Pa = 0.92, h = 3.7 


hybrid method for all the network nodes. Similar results 
hold for the link-based method. From the figure, we ob¬ 
serve a very good match for almost all the node angles, 
especially at lower temperatures. Tables [V| and |VI| show 
the logarithmic losses and the performance of GR, where 
the results are very similar to those in Tables [Till and [TV] 

Summary of the results. To summarize, in this sec¬ 
tion we have validated that: (i) the common-neighbors 
method is more accurate than the link-based method for 
nodes appearing at early MLE times; (ii) at larger MLE 
times, the two methods yield approximately the same re¬ 
sults; (iii) the hybrid method performs significantly bet¬ 
ter from the link-based method if correction steps are 
not used; (iv) if correction steps are used, then hybrid 
and link-based methods perform similarly; (v) correction 
steps can help improve the quality of the obtain mappings 
in all cases, but their effect on the hybrid method is not as 
significant as in the link-based method; and (vi) the fast 
and original versions of the methods perform almost the 
same. Our results indicate that the best options are the 
fast versions of either the hybrid or link-based methods 
with correction steps. However, we note that the cor¬ 
rection steps are an ad-hoc and computational intensive 
heuristic, requiring 0{i^) computations if run at time i. 
We have observed that these steps are beneficial when run 
at relatively small times i, not exceeding a few hundred 
nodes [5]. But being a heuristic, there are no universal 


guidelines of when exactly they should be invoked on a 
given real network to be embedded with the best results. 
Since correction steps do not have a significant effect on 
the hybrid approach, the fast hybrid method without cor¬ 
rection steps might be the best option in general in terms 
of accuracy and computational complexity tradeoffs. 

VI. APPLICATION TO THE INTERNET 

We now consider the Autonomous Systems (AS) In¬ 
ternet topology extracted from the data collected by the 
Archipelago active measurement infrastructure (ARK) 
developed by CAIDA na, which is available at m- 
The connections in the topology are not physical but 
logical, representing AS relationships [16]. Specifically, 
an AS is a part of the Internet infrastructure admin¬ 
istrated by a single company or organization. Pairs 
of ASes peer to exchange traffic. These peering rela¬ 
tionships in the AS graph are represented as links be¬ 
tween AS nodes. CAIDA’s IPv4 Routed /24 AS Links 
Dataset m provides regular snapshots of AS links de¬ 
rived from ongoing traceroute-based IP-level topology 
measurements. A detailed description of the measure¬ 
ment process is given in m- The AS topology has a sta¬ 
ble power law degree distribution with exponent 7 = 2 . 1 , 
average node (AS) degree « 5 and average cluster¬ 
ing c « 0.6. We consider 6 snapshots of the topology 
spaced by 3-month intervals from September 2009 to De¬ 
cember 2010. These snapshots consist respectively of 
t = 24091,25910,26307,26756,28353,29333 ASes. 

Logarithmic Loss and greedy routing efficiency. 
In Fig. we mapped the Sept. 2009 snapshot using the 
fast hybrid and link-based methods with and without cor¬ 
rection steps. The correction steps were applied as de¬ 
scribed in the previous section. In all cases we used the 
estimated m = 1.5, L = = 1 , 7 = 2.1,^ = 1, and 

different values of T in [0.1,0.9]. The speedup heuristic 
was applied for all nodes with degrees k < A:speedup = 3. 
Figs. [To|(a),(b) show the obtained logarithmic losses and 
the efficiency of greedy routing (GR) in all cases. We 
observe that correction steps do not have a significant ef¬ 
fect on the hybrid method whose lowest logarithmic loss 
is obtained at T = 0.6. This value is close to the value 
T = 0.45-0.5 required to construct synthetic networks 
with the same clustering c as in the Internet |5]. The 
link-based method without correction steps yields signifi¬ 
cantly higher logarithmic losses than the hybrid method, 
for almost all temperature values T. These losses de- 






















14 





FIG. 8: Likelihood landscapes for different nodes in a synthetic network with t = 5000 nodes and parameters m = 1.5, L = 
2.5 ,7 = 2.1, T = 0.4. The plots show the log-likelihoods i = 600,1000, 2000, with the original version of the method that 

samples the likelihood over the whole [0, 27r] domain (dashed red line), and with its fast version that samples the likelihood 
only over the 0-region shown by the solid black line. The vertical line in each plot shows the initial estimate for the angle, 
while jg inferred angle. 



steps. steps. steps. 

FIG. 9: Inferred angles (in radians) with the original and fast versions of the hybrid method for synthetic networks with 
t = 5000 nodes, m = 1.5, L = 2.5,7 = 2.1, and T as shown in the captions. 


crease when correction steps are used and become sim¬ 
ilar to the ones in the hybrid method. These results 
agree with our observations in the previous section on 
synthetic networks, which indicated that the link-based 
method without correction steps is not as accurate at 
inferring the angular coordinates of nodes, while correc¬ 
tion steps are not as important for the hybrid method, 
cf. Figs. Si and Tables EllnT} 

GR is also very efficient. In the hybrid method, with 
or without correction steps, the success ratios are close to 
90% for a wide range of T in [0.3,0.6]. In the link-based 
method without correction steps the success ratios are 
smaller, and become similar to the hybrid method’s only 
if correction steps are used. These results agree again 
with our previous observations on synthetic networks, cf. 


Tables [g [Tv 


Prediction of future links. Fig. 10 [c) shows the 


empirical probability that a future link appears between 
two disconnected ASes as a function of their hyperbolic 
distance in Sept. 2009. To compute this probability, we 
consider all disconnected AS pairs in Sept. 2009 and all 
future links that appear between these pairs in the pe¬ 
riod Sept. 2009-Dec. 2010 (48119 new links). We then 
bin the range of hyperbolic distances between these pairs 
from zero to the maximum distance into small bins. For 


each bin we find all the disconnected pairs located at the 
hyperbolic distances falling within the bin. The percent¬ 
age of pairs in this set of pairs that get connected with 
a future link, is the value of the empirical future-link 
probability at the bin. From Fig. [l0|(c), we observe that 
this probability decreases with the hyperbolic distance 
between disconnected ASes, as expected. Furthermore, 
this decrease is exponential at large distances. We note 
that the shape of this probability is similar to the con¬ 
nection probability in our model, cf. Fig. but it has a 
slope that does not depend on T; in fact, different values 
of T ^ 0.7 yield very similar results. 

To provide a deeper insight on the ability of the fast 
hybrid and link-based methods to predict future links, 
we also compute the Area Under the Receiver Operat¬ 
ing Characteristic Curve (AUC) |17j . The AUC here is 
defined as the probability that a randomly selected link 
from the set of our future links is given a better score (i.e., 
a higher existense likelihood) than a randomly selected 
nonexistent link, where the “nonexistent links” are the 
disconnected AS pairs in Sept. 2009 that never get con¬ 
nected in Sept. 2009-Dec. 2010. The score Sij between 
two disconnected ASes {i,j) is the hyperbolic distance Xij 
between them. The smaller this score, i.e., the smaller 
the hyperbolic distance between two disconnected ASes, 
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FIG. 10: Logarithmic Loss GR success ratio (ps), and future-link probability in a mapped snapshot of the AS Internet 

(Sept. 2009 snapshot). In plots (a) and (b) the results are obtained by the fast hybrid and link-based methods, with and 
without correction steps. The results in plot (c) are obtained by the fast hybrid method with correction steps. In all cases 
^speedup = 3, and the results are shown for different values of the temperature parameter T. 


the more likely it is that these two ASes will get con¬ 
nected, cf. Fig. 10 c). The degree to which the AUC 


exceeds 0.5 indicates how much better the method per¬ 
forms than pure chance, while AUC = 1 is the best pos¬ 
sible AUC. 

The results are shown in Fig. [T^a) for different values 
of T, and are juxtaposed to the results obtained with the 
Preferential Attachment (PA) and Common-Neighbors 
(CN) heuristics [T7j. In PA, the score between two dis¬ 
connected ASes {i,j) is Sy = ki x kj, where ki,kj are 
the degrees of the ASes, while in CN stj = riij , where riij 
is the number of common neighbors between the ASes. 
The higher these scores the higher the chance of a future 
link between the disconnected ASes. From Fig. [IT|)a), 
we observe that the fast hybrid and link-based methods 
yield very high AUC values, around 0.97 for almost all 
T, outperforming the PA and CN heuristics. Note that 
hybrid and link-based methods perform similarly with re¬ 
spect to this performance metric. This is not surprising, 
since as we have seen in the previous section, the re¬ 
sulting connection probabilities in the two methods are 
quite similar, cf. Figs. m In particular, even though 
the link-based method without correction steps is not 
as accurate at inferring the real angular coordinates of 
nodes, cf. Figs.|^d-f), its resulting connection probabili¬ 
ties are close to the ones obtained by the hybrid method, 
cf. Fig. That is, these results also agree with our pre¬ 
vious observations on synthetic networks. In Fig. |llD b), 
we compute the AUC by considering only disconnected 
AS pairs with no common neighbors and the future links 
among these pairs. In this case, CN performs as good 
as pure chance since it assigns a zero score to all the 
pairs, while the fast hybrid and link-based methods still 
perform remarkably well, with AUC values between 0.89 
and 0.92. Finally, in Fig. we compute the AUC by 

considering only disconnected AS pairs with low degrees, 
less than the average degree k = 5, and the future links 
among these pairs. The figure shows that the methods 
still perform very well, with AUC values between 0.79 
and 0.85 for T ^ 0.8, significantly outperforming the PA 


and CN heuristics. 

To summarize, our results indicate that our methods 
have a very strong predictive power. Specifically, they 
perform remarkably well not only in predicting the “easy- 
to-predict” future links, i.e., the links that appear among 
nodes with high degrees or many common neighbors, but 
also in predicting the “hard-to-predict” future links, i.e., 
the links that appear among nodes with low degrees or 
no common neighbors. In that sense one can say that 
the measure of proximity (hyperbolic distances) between 
nodes in our approach reflects reality more accurately 
than the PA and CN approaches do, and that our meth¬ 
ods can infer these distances in the real Internet quite 
accurately. The predictive power of our methods is not 
very sensitive to the exact value of T, with the best re¬ 
sults obtained for T < 0.8, cf. Fig. 

Evolution of soft AS communities. In Fig. [T^ we 
map our 6 AS snapshots, using the fast hybrid method 
with correction steps as before, with T = 0.6 that yielded 
the lowest logarithmic loss and fcgpeedup = 3. In all cases, 
the angle 6i of node z = 1 (see step 3 of Fig. is fixed 
to 6*1 = TT. Figs. [I2](a-f) show that the method pro¬ 
duces meaningful mappings, in the sense that the method 
infers soft communities of ASes belonging to the same 
country, where by soft communities we mean groups of 
nodes located close to each other in the space. For each 
mapped snapshot. Fig. shows the angular distribu¬ 
tion of ASes belonging to the same country for 20 differ¬ 
ent countries. For comparison among the distributions, 
for each snapshot after Sept. 2009 we consider only the 
ASes that were also present in Sept. 2009. The cc-axis in 
Figs. [T^a-f) (angular coordinate) uses bins of size 3.6°. 
The AS-to-country mapping is taken from the CAIDA 
AS ranking project [18]. We observe that the fast hy¬ 
brid method places ASes belonging to the same coun¬ 
try close to each other in the angular space. The rea¬ 
son for this is that ASes belonging to the same country 
tend to connect more densely to each other than to the 
rest of the world. Connected ASes are attracted to each 
other, while disconnected ASes repel, and the fast hy- 
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(a) 


(b) 


(c) 


FIG. 11: Performance of future-link prediction in the AS Internet with the fast hybrid and link-based methods (fcspeedup = 3), 
and comparison to the Preferential Attachment (PA) and Common-Neighbors (CN) heuristics. In each case, the AS snapshot 
of Sept. 2009 is considered. In plot (a), the AUC is computed over all disconnected AS pairs and the new links that appear 
between them in Sept. 2009-Dec. 2010 (48119 new links); in plot (b), the AUC is computed only over the disconnected AS 
pairs that have no common neighbors (95% of all disconnected pairs) and the new links between them (9279 new links); and in 
plot (c), the AUC is computed only over the disconnected AS pairs with degrees k,k' < fc = 5 (72% of all disconnected pairs) 
and the new links between them (2050 new links). 


brid method feels these attraction/repulsion forces, plac¬ 
ing groups of densely connected ASes in narrow regions, 
close to each other. As expected, due to significant geo¬ 
graphic spread in ASes belonging to the US, these ASes 
are more widespread. We note that other reasons be¬ 
sides geographic proximity may affect the connectivity 
between ASes, such as economical, political, and perfor¬ 
mance related reasons. The mapping method does not 
favor any specific reason but relies only on the connectiv¬ 
ity between ASes in order to place the ASes at the right 
angular (and consequently hyperbolic) distances. 

Figs. [T^g-i) also show how the angular center of 
masses of the considered AS communities evolve in the 
similarity space during the period Sept. 2009-Dec. 2010. 
We observe that some communities, e.g., USA and sev¬ 
eral European countries, have a more stable position in 
this space than others, e.g., Argentina and Brazil. The 
observed dynamics in the similarity space is likely due 
to a combination of two classes of factors: 1) stochastic 
fluctuations and noise coming from the data (our map¬ 
ping does not introduce any additional randomness since 
the algorithm is deterministic), and 2) real dynamics of 
nodes and communities in the similarity space, caused 
by new connections and disconnections within and across 
the communities. Similar results hold for the link-based 
method with correction steps. 


VII. OTHER RELATED WORK 

A different mapping of the AS Internet to the hyper¬ 
bolic plane was performed in m- The authors found 
that the hop lengths of the shortest AS paths in the In¬ 
ternet can be embedded into the hyperbolic plane with 
low distortion, and that the resulting embedding can be 
used for efficient overlay network construction and ac¬ 
curate path distance estimation. Our work is different 


from m, in that hyperbolic distances between ASes in 
our case are not directly defined by their “observable” 
AS path lengths. Instead, they are defined by “hidden” 
popularity and similarity node coordinates that manifest 
themselves indirectly via the nodes’ connections and dis¬ 
connections. As indicated by the performance of greedy 
routing in Section |Vj short paths follow well the under¬ 
lying hyperbolic geodesics in our mappings. However, 
nodes at short path distances are not always hyperboli- 
cally closer than nodes separated by longer paths. For the 
same reason, our approach differs from multidimensional 
scaling (MDS) techniques, which try to compute coor¬ 
dinates for points in low dimensional geometric spaces, 
e.g., [20], such that the distances between the points in 
these spaces match as closely as possible some given dis¬ 
tances between the points. 

Besides perhaps the most relevant earlier work 

is m- In this work, the authors considered a model 
of social networks where nodes reside in a latent Eu¬ 
clidean space |22|- Nodes that are sufficiently close in 
this space have higher chances of being connected. Based 
on this model, the authors presented a combined MDS 
and maximum likelihood estimation (MLE) procedure 
for inferring the node coordinates in the latent space. 
The procedure can take into consideration previously es¬ 
timated node positions, e.g., estimated node positions in 
a previous closely-spaced network snapshot, and penal¬ 
ize large displacements from these positions, in an at¬ 
tempt to yield more accurate embeddings. The authors 
applied this procedure to create embeddings for link pre¬ 
diction, and to illustrate how relationships between au¬ 
thors in co-authorship data change over time. The main 
difference of our work from m is that in our case the 
latent space is not Euclidean but hyperbolic, the latter 
providing a more accurate reflection of the geometry of 
real networks HISIII]. In contrast to earlier work on la¬ 
tent network geometry inference, here we have departed 
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FIG. 12: Distributions of angular coordinates of ASes belonging to the same country during September 2009-December 2010 
(a-f), and the evolution of the angular center of masses of the corresponding communities over time (g-i). For each snapshot 
in (a-f), the angular center of mass of each country is 6cm = where n is the number of ASes belonging to 

the country, n{b) is the number of such ASes falling within bin b, 9{b) is the value of 9 in the bin, and the summation is over 
all the bins. For each country, 9cm is shown (g-i) as a function of the network time t, i.e., as a function of the number of ASes 
in the snapshots (a-f), t = 24091, 25910, 26307, 26756, 28353, 29333, respectively. 


from the traditional link-based inference methods, and 
based our inference entirely on a higher-order similarity 
statistics—the statistics of the number of common neigh¬ 
bors between nodes. 


VIII. CONCLUSION 

In summary, we have introduced and explored a new 
method for inferring node similarity coordinates based 
on the number of common neighbors between nodes, and 
have released the software package implementing this 
network mapping method to public [8]. We have shown 
that this approach is more accurate than the link-based 
approach [3, unless heuristic periodic adjustments (or 
correction steps) are used. The common-neighbors ap¬ 
proach is more computationally intensive, but we have 
devised a hybrid method that combines the common- 
neighbors and link-based approaches, and showed how 


to reduce its running time to 0{t^). The correction steps 
can be used in this hybrid approach as well, but their 
effect is not significant. Therefore they can be entirely 
avoided to reduce running time. We have validated this 
method on synthetic model networks, and applied it to 
the evolving AS Internet. Taken altogether, our results 
advance our understanding of how to efficiently and accu¬ 
rately map real networks to their underlying hyperbolic 
spaces. 

An interesting open problem is whether more compu¬ 
tationally efficient but also more sophisticated numeri¬ 
cal optimization methods |23j can be applied to the la¬ 
tent network geometry inference problem. Such methods 
may expedite the ma ximization of the likelihoods C\^ and 
in Eqs. ([^, |l^ , without sacrificing the embedding 
quality. We note that our “brute-force” approach of sam¬ 
pling the likelihoods at small A6 intervals in order to find 
their global maximum appears currently to be the best 
option among all other methods that we have invest!- 
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gated. These methods [23] tend to work reliably only if 
the function to maximize is relatively smooth, has only 
one easily detectable global maximum, or only few lo¬ 
cal maxima. In contrast, the likelihood profiles we have 
to deal with. Figs. [3l8l are very rugged and rough, abun¬ 
dant with sharp local maxima, rendering unusable all the 
other methods that we have experimented with. 

All the inference methods presented here and in [3113] 
use the uniform distribution as the prior [24] for the an¬ 
gular distribution of nodes, Eq. (12). This means that 
the methods do not make any prior assumption about 
the node angular positions. Instead, they assume that 
all positions are equiprobable, and let the given data, 
i.e., the given network adjacency matrix, to determine 
the positions. The distributions of the inferred angular 
coordinates can then be non-uniform in mappings of real 
networks produced by these methods, since many real 
networks tend to have some nontrivial community struc¬ 
ture. For example. Figure [^ shows the distribution of 
the inferred AS angles in September 2009. The lowest 
logarithmic loss (Fig, [l^a)) is achieved at T = 0.6, and 
the corresponding distribution of angular coordinates is 
clearly non-uniform. In this context, an interesting open 
problem is to consider extensions of network geometry 
models that are capable of explaining the emergence of 
soft community structure in networks and non-uniform 
distribution of nodes in the similarity space, e.g., pS] . 
and to develop mapping methods for such models that 
would use non-uniform priors. 


Finally, given an efficient and accurate method to map 
real complex networks into their underlying hyperbolic 
spaces, one of the most interesting open problems is to 
decipher the laws that govern the dynamics of nodes in 
these spaces. Fig. [^ As real networks are character¬ 
ized by a hierarchical organization and nontrivial com¬ 
munity structure mm, we expect this dynamics to be 
also highly nontrivial, but definitely not random. This 
observation suggests that it might be possible to accu¬ 
rately predict the future positions of nodes in the under¬ 
lying hyperbolic spaces. The precise knowledge of this 
spatial dynamics of nodes can then be used to predict 
fine-grained network dynamics, forecasting future con¬ 
nections and disconnections among nodes over different 
timescales. 
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(a)T = 0.1. (b)T = 0.3. (c) T = 0.6. (d) T = 0.8. 


FIG. 13: Distribution of the inferred AS angles (Sept. 2009 snapshot) with the fast hybrid method (fcspeedup = 3) and different 
values of the temperature parameter T. 
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